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1. The Subject-matter of Game Theory. 
Basic Concepts 



When solving practical (economic, military or other) problems 
one often has to analyse situations in which there are two, or more 
quarrelling parties pursuing conflicting objectives, and where the 
outcome of each action of one party depends on the opponent's 
choice of a course of action. Such situations will be called "conflict 
situations". 

One can cite numerous examples of conflict situations from 
various practical situations. All situations arising in the course of 
military action are conflict situations: each of the contending 
parties takes every available measure to prevent the opponent from 
succeeding. Conflict situations also arise in choosing a weapon or a 
mode of its combat use, and in general, in planning military 
operations; every decision in this area must be made assuming that 
the opponent's action will be the least favourable one. A number 
of economic situations (especially those where there is free competition) 
are conflict situations. The contending parties here are firms, industrial 
enterprises, etc. 

The need to analyse situations of this kind has brought about the 
development of special mathematical methods. The theory of games 
is in fact a mathematical theory of conflict situations. The aim of the 
theory is to elaborate recommendations for each of the opponents 
to act rationally in the course of a conflict situation. 

All conflict situations that occur in practice are very complicated 
and their analysis is hampered by many attendant factors. For 
a mathematical analysis of a situation to be possible, it is necessary 
to disengage oneself from these secondary factors and construct a 
simplified, formalized model of the situation. Such a model will be 
called a "game". 

A game differs from a real conflict situation in that it is 
played according to definite rules. Man has long used such 
formalized models of conflict situations — games in the literal sense 
of the word, for example chess, checkers, card games and so on. Each 
of these games takes the form of a contest proceeding according 
to certain rules and ending in one or another player's "victory" 
(or gain). 

Such formally regulated, artificially arranged games provide the 
most suitable material for illustrating and learning the basic 
concepts of game theory. The terminology borrowed from such 
games is used in the analysis of other conflict situations as well; 



the convention has been adopted of referring to the parties taking 
part in them as "players" and to the outcome of an encounter as a 
party's "gain" or "payoff'. 

A game may be clash of interests of two or more opponents; 
in the first case the game is called a two-person game and in the 
second it is called a multiperson game. The participants of a multiper- 
son game may, in the course of the game, form coalitions, constant or 
temporary. If there are two constant coalitions in a multiperson 
game, it becomes a two-person game. In practice the most important 
games are two-person games. We shall confine ourselves to these 
games only. 

We shall begin our presentation of elementary game theory by 
formulating some basic concepts. We shall consider the two-person 
game in which two players, A and B, with opposing interests 
take part. By a "game" we shall mean an arrangement consisting 
of a number of actions taken by parties A and B. For a game to be 
treated mathematically it is necessary to formulate the rules of the 
game exactly. The "rules of a game" are understood to be a set of 
conditions which regulate the conceivable alternatives for each party's 
course of action, the amount of information each party has about the 
other party's behaviour, the sequence of alternating the "moves" 
(individual decisions made in the course of the game), and the 
result or outcome to which the given totality of moves leads. This result 
(gain or loss) does not always have a quantitative expression, 
but it is usually possible to express the result by a certain 
number by establishing some measuring scale. For example, it 
might be agreed in chess to assign the value + 1 to a victory, the 
value — 1 to a defeat, and the value to a draw. 

A game is called a zero-sum game if one player gains what the 
other loses, i. e. if the sum of both parties' gains is equal to zero. 
In a zero-sum game the players' interests are completely opposed. 
We shall consider only zero-sum games in the following. 

Since in a zero-sum game one player's gain is the other 
player's gain taken with the opposite sign, it is obvious that in 
analysing such a game it is possible to consider just one player's 
gain. Let us take player A, for example. For the sake of convenience, 
in what follows we shall arbitrarily refer to party A as "we" and to 
party B as "the opponent". 

Party A ("we") will always be regarded as the winner and party B 
("the opponent") as the loser. Evidently this formal condition does 
not imply any real advantage to the first player. It can 
easily be seen that it can be replaced by the opposite condition 
if the sign of the gain is reversed. 



We shall consider the development of a game in time as a series 
of successive steps or "moves". In game theory a move is a choice of an 
alternative from the alternatives that are allowed by the rules of 
the game. Moves can be classified as personal or chance moves. 

A personal move is a player's deliberate choice of one of the 
moves possible in the given situation, and its realization. 

An example of a personal move is a move in a game of chess. 
In making his move, a player makes a deliberate choice among the 
alternatives possible for a given disposition of pieces on the chessboard. 

The set of possible alternatives is stipulated for each personal 
move by the rules of the game and depends on the totality 
of both parties' previous moves. 

A chance move is a choice among a number of possibilities 
which is realized not by the player's decision but by some random 
device (the tossing of a coin or a dice, the shuffling and dealing of 
cards, etc.). For example, dealing the first card to a bridge player 
is a chance move with 52 equally possible alternatives. 

For a game to be mathematically definite, the rules must 
indicate for each chance move the probability distribution of the 
possible outcomes. 

Some games may contain only chance moves (the so-called games 
of pure chance) or only personal moves (chess, checkers). Most 
card games are of mixed type, i. e. they consist of both chance 
and personal moves. 

Games are classified not only according to the nature of the moves 
(into personal and chance moves), but also according to the nature 
and amount of information available to either player concerning 
the other's actions. A special class of games is formed by 
"games with perfect information". A game with perfect information 
is a game in which either player knows at each move the results 
of all the previous moves, both personal and chance. Examples of 
games with perfect information are chess, checkers, and the well- 
known game of "noughts-and-crosses". 

Most of the games of practical importance are not games with 
perfect information since the lack of information about the 
opponent's actions is usually an essential element of conflict 
situations. 

One of the basic concepts of game theory is the concept of a 
"strategy". 

A strategy for player is a set of rules unambiguously 
determining the choice of every personal move of the player, 
depending on the situation that has arisen in the course of the game. 

The concept of strategy should be explained in more detail. 



A decision (choice) for each personal move is usually made by 
the player in the course of the game itself depending on the 
particular situation that has arisen. Theoretically, the situation 
will not be altered, however, if we imagine that all the decisions 
are made by the player in advance. To that end the player would have 
to make a list of all situations that might occur in the course 
of the game beforehand and foresee his decision for each of them. 
This is possible in principle (if not in practice) for any game. 
If such a system of decisions is adopted, it means that the player 
has chosen a definite strategy. 

A player who has chosen a strategy may now abstain from taking 
part in the game personally and substitute for his participation a list 
of rules to be applied for him by some disinterested person (a 
referee). The strategy may also be given to an automatic machine in 
the form of a certain programme. It is in this way that modern 
electronic computers play chess. 

For the concept of "strategy" to have sense, a game must have 
personal moves; there are no strategies in games comprising only 
chance moves. 

Games are classified into "finite" and "infinite" ones depending 
on the number of possible strategies. 

A game is said to be finite if either player has only a finite 
number of strategies. 

A finite game in which player A has m strategies and player B n 
strategies is called an m x n game. 

Consider an m x n game between two players, A and B ("we" 
and "our opponent"). 

We denote our strategies by A u A 2 , ... , A m , and our opponent's 
strategies by B u B 2 , ■■-, B„. 

Suppose that either party has chosen a definite strategy; let it be 
Aj in our case, and Bj in our opponent's. 

If the game contains only personal moves, then the choice of strate- 
gies A h Bj unambiguously determines the outcome of the game — our 
gain (payoff). We denote it by a y . 

If the game consists of chance moves as well as personal 
ones, then the gain for a pair of strategies A t and B ; is a random 
quantity depending on the outcomes of all chance moves. In this 
case the natural estimate of the gain expected is its mean value 
(mathematical expectation). We shall denote by the same symbol a y 
both the gain (payoff) itself (in games without chance moves) 
and its mean value (in games with chance moves). 

Suppose we know the value a^ of the gain or payoff (or the 
average values) for each pair of strategies. The values a y can be 
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written in the form of a rectangular array (or matrix) the rows of 
which correspond to our strategies (/!,) and the columns to our 
opponent's strategies (Bj). This array is called a gain or payoff 
matrix, or simply the matrix of the game or game matrix. 
The matrix of an m x n game is of the form: 



/ * 


*. 


B 2 




B„ 


A, 


a \\ 


a \2 




«!/, 


*i 


a 2\ 


a 22 




"In 












"in 


«/»! 


a m l 




"mn 



We shall briefly denote a game matrix by |a,j||. 

Consider a few elementary examples of games. 

EXAMPLE 1. Two players, A and B, without looking at each 
other, each places a coin face up on a table showing either heads 
or tails as they like. If both choose the same side (either heads 
or tails), then player A takes both coins; otherwise they are taken 
by player B. Analyse the game and construct its matrix. 

SOLUTION. The game contains only two moves, our move and our 
opponent's move; both are personal. It is not a game with 
perfect information since at the moment a move is made the 
player who makes it does not know what the other player will do. 

Since either player has only one personal move, a player's 
strategy is a choice for this single personal move. 

There are two strategies for us: choosing heads, A u and 
choosing tails, A 2 ; there are the same two strategies for our opponent : 
heads, B u and tails, B 2 . So this is a 2x2 game. Let a gain of the 
coin count +1. The game matrix is given below. 

This game, simple as it is, may help us to understand some essen- 
tial ideas of game theory. 

First assume that the game is played only once. Then it is 
evidently useless to speak of any "strategies" for the players 
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B t (heads) 


Bj (tails) 


A x (heads) 


1 


-1 


A 2 (tails) 


-1 


1 



being more clever than the others. Either player may equally 
reasonably make either decision. When the game is repeated, however, 
the situation changes. 

Indeed, assume that we (player A) have chosen some strategy (say 
A t ) and are keeping to it. Then from our initial moves our 
opponent will guess what our strategy is and respond to it in the 
manner least advantageous to us, i. e. he will choose tails. 
It is clearly not advantageous for us to always play only one of our 
strategies; in order not to lose we must sometimes choose heads 
and sometimes tails. However, if we alternate heads and tails in any 
definite order (for example, one after the other), our opponent may also 
guess this and counter our strategy in the worst manner for us. 
Evidently a reliable method which would guarantee that our opponent 
is not aware of our strategy is to arrange our choice of each 
move in such a way that we do not know it in advance ourselves 
(this could be ensured by tossing a coin, for instance). Thus, 
through an intuitive argument we have approached one of the essential 
concepts of game theory, that of a "mixed strategy", i. e. a strategy in 
which "pure" strategies — A , and A 2 in this case — are 
alternated randomly with certain frequencies. In this example, it is 
known in advance from symmetry considerations that strategies 
/4, and A 2 should be alternated with the same frequency; in more 
complicated games, the decision may be far from .being trivial. 

EXAMPLE 2. Players A and B each write down simultaneously 
and independently of each other, one of the three numbers: 1, 2, or 3. 

If the sum of the numbers they have written down is even, then 
B pays A that sum in dollars; if the sum is odd, then, on the 
contrary, A pays that sum to B. Analyse the game and construct 
its matrix. 

SOLUTION. The game consists of two moves, both of which 
are personal. We (A) have three strategies: writing down 1, A t ; 
writing down 2, A 2 ; and writing down 3, A 3 . Our opponent 
(B) has the same three strategies. This is a 3 x 3 game with the 
matrix given below. 
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S, 


»7 


B. 


*l 


2 


-3 


4 


*2 


-3 


4 


-5 


*l 


4 


-5 


6 



Evidently our opponent can, as in the previous case, respond 
to any strategy chosen in the way which is worst for us. Indeed, 
if we choose strategy A t , for instance, our opponent will always 
counter it with strategy B 2 ; strategy A 2 will always be countered 
with strategy B 3 and strategy A 3 with strategy B 2 ; thus any 
choice of a definite strategy will inevitably lead us to a loss.* 
The solution of this game (i. e. the set of the most advantageous 
strategies for both players) is given in Section 5. 

EXAMPLE 3. We have three kinds of weapon at our disposal, 
A u A 2 , and A 3 ; the enemy has three kinds of aircraft, B u B 2 , 
and B 3 . Our goal is to hit an aircraft, while the enemy's goal 
is to keep it unhit. When armament A { is used, aircraft B,, B 2 , 
and B 3 are hit with probabilities 0.9, 0.4, and 0.2, respectively; 
when armament A 2 is used, they are hit with probabilities 
0.3, 0.6, and 0.8; and when armament A 3 is used, with probabilities 
0.5, 0.7 and 0.2. Formulate the game in terms of game theory. 

SOLUTION. The situation can be regarded as a 3x3 game 
with two personal moves and one chance move. Our personal 
move is to choose a kind of a weapon; the enemy's personal move 
is to choose an aircraft to take part in the combat. The chance move 
is to select the weapon; this move may or may not end with 
hitting the aircraft. Our gain is unity if the aircraft is hit, otherwise 
it is zero. Our strategies are the three alternative weapons; 
the enemy's strategies are the three alternative aircraft. 

The mean value of the gain in each of the specified pairs of 
strategies is just the probability of hitting a given aircraft with a 
given weapon. The game matrix is given below. 



* One should not forget, however, that our opponent's position is as bad 
as ours. 
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B. 


«2 


«3 


Ax 


0.9 


0.4 


0.2 


A 2 


0.3 


0.6 


0.8 


Al 


0.5 


0.7 


0.2 



The aim of game theory is to work out recommendations for 
the player' rational behaviour in conflict situations, i. e. to determine 
an "optimal strategy" for each player. 

In game theory, an optimal strategy for a player is a 
strategy which, when repeated many times, assures him the maximum 
possible average gain (or, which amounts to the same thing, the 
minimum possible average loss). The argument for the choice of 
this strategy is based on the assumption that the opponent is at least 
as rational as we ourselves are and does everything to prevent us from 
achieving our object. 

All recommendations in game theory are deduced from these 
principles. Consequently, no account is taken of risk elements, 
which are inevitably present in every real strategy, nor of possible 
miscalculations or errors made by the players. 

Game theory, like any mathematical model of a complex 
phenomenon, has its limitations. The most serious limitation is the 
fact that the gain is artificially reduced to only one number. 
In most practical conflict situations, when working out a rational 
strategy one has to take into account several criteria of successful 
action, i. e. several numerical parameters rather than one. A strategy 
optimal according to one criterion is not necessarily optimal 
according to another. However, by realizing these limitations and 
therefore not blindly following the recommendations obtained 
by game theoretic methods, one can still employ mathematical game 
theory techniques to work out a strategy which would at any rate be 
"acceptable", if not "optimal". 



2. The Lower and the Upper Value of the Game. 
The "Minimax" Principle 

Consider an m x n game having the following matrix. 
We shall denote by the letter /' the number of our strategy and 
by the letter j the number of our opponent's strategy. 
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^\-. B 

A ^\^ 


A, 


B, 




B„ 


Ay 


"w 


a n 




"h, 


A-, 


a 2\ 


a 22 




«2» 












A m 


a,„\ 


a„,2 




&nm 



We undertake to determine our optimal strategy. We shall analyse 
each of our strategies sequentially starting with A x . Choosing 
strategy A t we must always count on our opponent responding 
to it with that strategy Bj for which our gain a y is minimal. 
We shall determine this value of the gain, i. e. the smallest of 
the numbers a (j in the ith row. We shall denote it by a f : 

(2.1) 



a ; = min a, 

j 



■j 



Here the symbol min (minimum over j) denotes the minimum 
j 
value of the given parameter for all possible j. 

We shall write the numbers a, next to the matrix above 
in an additional column. 

By choosing some strategy A i we can count on winning, as a 
result of our opponent's rational actions, not more than a ( . 
It is natural that, by acting in the most cautious way and 
assuming the most rational opponent (i. e. avoiding any risk), 
we must decide on a strategy A { for which number a, is maximal. 
We shall denote this maximal value by a: 

a = max a, 

or, taking into account formula (2.1) 

a = max mina n 



The quantity a is called the lower value of the game, or else the maximin 
gain or simply the maximin. 
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B, 


B z 




B„ 


«l 


Ax 


(•w 


«12 




"1.1 


«1 


A 2 


°2\ 


a 22 




«2„ 


«2 














A m 


a,„\ 


a„a 




^mit 


^/i/ 


P, 


P, 


02 




P,„ 





The number a lies in some row of the matrix; player A's 
strategy which corresponds to this row is called a maximin strategy. 

Evidently, if we keep to a maximin strategy, then, whatever our 
opponent's behaviour, we are assured a gain which is in any case 
not less than a. Because of this the quantity a is called the 
"lower value of the game". This is the guaranteed minimum which 
we can assure ourselves by keeping to the most cautious ("play 
safe") strategy. 

Evidently, a similar argument can be carried out for opponent B. 
Since the opponent is interested in minimizing our gain, he must 
examine each of his strategies in terms of the maximum gain for it. 
Therefore we shall write out the maximal values a is for each column at 
the bottom of the matrix: 

(3j = max a tj 



and find the minimal value P ; : 



or 



(3 = minp/ 

,; 



P = min max a ij 



The quantity P is called the upper value of the game, or else 
the "minimax". The opponent's strategy corresponding to the 
minimax gain is called his "minimax strategy". 
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Keeping to his most cautious minimax strategy the opponent 
assures himself the following: whatever move we make against 
him, he will lose a sum not greater than p. 

In game theory and its applications, the principle of cautiousness 
which dictates to the players the choice of the corresponding 
strategies (the maximin and the minimax one) is often called the 
"minimax principle". The most cautious maximin and minimax 
strategies for the players are designated by the general term 
"minimax strategies". 

By way of illustration we shall determine the lower and the 
upper value of the game and the minimax strategies for the 
games in Examples 1, 2, and 3 of Section 1. 

EXAMPLE 1. Example 1 of Section 1 gives a game with the 
matrix given below. 



A ^^-^ 


8, 


h 


C)j 


*\ 


1 


-1 


-1 


A 2 


-1 


1 


-1 


P, 


1 


1 





Since the values a, and p, are constant and are — 1 and + 1 
respectively, the lower and upper values of the game are also 
-1 and + 1: 

0t= -1; p= +1 

Either strategy of player A is a maximin strategy, and either 
strategy of player 6 is a minimax strategy- The conclusion that 
follows is trivial: by adopting any of his strategies, player A 
can assure that his loss is not greater than 1; the same can also 
be guaranteed by player B. 

EXAMPLE 2. Example 2 of Section 1 gives a game with the 
following matrix. 

The lower value of the game a = — 3; the upper value of the 
game P = 4. Our maximin strategy is A , ; employing it systematically 
we can count on a gain not less than —3 (a loss not greater 
than 3). The opponent's minimax strategy is either of the 
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^^\^ B 

A ^^-^^ 


B, 


«; 


«j 


«i 


A\ 


2 


-3 


4 


-3 


*2 


-3 


4 


-5 


-5 


Ai 


4 


-5 


6 


-5 


P, 


4 


4 


6 





strategies Bi or B 2 ; employing them systematically he can in any case 
guarantee that his loss is not greater than 4. If we deviate from 
our maximin strategy (choose strategy A 2 , for instance) our opponent 
may "punish" us for this by playing strategy B 3 and reducing 
our gain to —5; similarly our opponent's deviation from his 
minimax strategy may increase his loss to 6. 

EXAMPLE 3. Example 3 of Section 1 gives a game with, the 
matrix 



^"W B 

A ^"\^ 


s, 


B, 


«J 


«/ 


Ax 


0.9 


0.4 


0.2 


0.2 


Ai 


0.3 


0.6 


0.8 


0.3 


Ai 


0.5 


0.7 


0.2 


0.2 


Pi 


0.9 


0.7 


0.8 





The lower value of the game a = 0.3; the upper value of the 
game p = 0.7. Our most cautious (maximin) strategy is A 2 ' ; 
using weapon A 2 , we guarantee that we hit the aircraft on average 
at least 0.3 of the time. The most cautious (minimax) strategy 
for our opponent is B 2 ; using this aircraft he can be. sure 
that it will be hit at most 0.7 of the time. 
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The last example can be very conveniently used to demonstrate 
one important property of minimax strategies, their instability. 
Suppose we are playing our most cautious (maximin) strategy A 2 , 
while our opponent is playing his most cautious (minimax) strategy 
B 2 . As long as both opponents are keeping to these strategies, the 
average gain is 0.6; it is greater than the lower value of the game, 
but less than its upper value. Suppose now that our opponent 
learns that we are playing strategy A 2 ; he will immediately respond 
to it with strategy B, and reduce our gain to 0.3. We in turn 
have a good answer to strategy 6,, namely strategy A { which 
gives us a gain of 0.9, etc. 

Thus the situation in which both players employ their minimax 
strategies is unstable and may be disturbed by information received 
about the opposite party's strategy. 

There are some games, however, for which minimax strategies 
are stable. These are games for which the lower value is equal to 
the upper one: 

a = p 

If the lower value of the game is equal to the upper one, 
then their common value is called the net value of the game 
(or simply the value of the game); we shall denote it by the 
letter v. 

Consider an example. Let a 4 x 4 game be given by the following 
matrix: 



^^\ B 


«, 


B 2 


B i 


B, 


~/j 


*i 


0.4 


0.5 


0.9 


0.3 


0.3 


^2 


0.8 


0.4 


0.3 


0.7 


0.3 


<*S 


0.7 


0.6 


0.8 


0.9 


0.6 


A A 


0.7 


0.2 


0.4 


0.6 


0.2 


P, 


0.8 


0.6 


0.8 


0.9 
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We find the lower value of the game: 

a = 0.6 

We find the upper value of the game: 

p = 0.6 

They turn out to be the same; consequently, the game has a new 
value equal to a = p = v = 0.6. 

The entry 0.6 marked out in the game matrix is simultaneously 
minimal in its row and maximal in its column. In geometry a point 
on a surface with a similar property (being a minimum for one 
coordinate and simultaneously a maximum for the other) is called 
a saddle point; by analogy, the term is used in game theory as 
well. A matrix element possessing this property is called a saddle 
point of the matrix, and the game is said to possess a saddle 
point. 

Corresponding to a saddle point are a pair of minimax 
strategies (A 3 and B 2 in this example). These are called optimal 
and their combination is called a solution of the game. 

The solution of the game has the following remarkable property. 
If one of the players (for instance, A) keeps to his optimal 
strategy and the other (B) deviates in some way from his optimal 
strategy, then there is never any advantage to the player (B) who has 
made the deviation; such a deviation on the part of player B 
would at best leave the payoff unchanged or at worst increase it. 

On the other hand, if B keeps to his optimal strategy and A 
deviates from his, then there is no possible advantage to A. 

This statement is easy to verify using the example of the game with 
a saddle point that we have been considering. 

We see that in the case of a game with a saddle point minimax 
strategies possess a peculiar "stability": if one party adheres to 
its minimax strategy, then it can be only disadvantageous for the 
other to deviate from his. Notice that in this case neither player's 
knowledge that his opponent has chosen his optimal strategy could 
change his own behaviour: if he does not want to act against 
his own interests, he must adhere to his optimal strategy. A pair 
of optimal strategies in a game with a saddle point are a kind of 
"equilibrium position": any deviation from his optimal strategy 
leads the deviating player to disadvantageous consequences forcing 
him to return to his initial position. 

So for every game with a saddle there is a solution giving 
a pair of optimal strategies of both parties which possesses the 
following properties. 
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(1) If both parties adhere to their optimal strategies, then the 
average gain is equal to the net value of the game, v, which is 
at the same time its lower and upper value. 

(2) If one of the parties adheres to its optimal strategy and the 
other deviates from its, then this would only make the deviating 
party lose and in no case would increase its gain. 

The class of games having a saddle point is of great interest 
from both theoretical and practical points of view. 

Game theory proves that every game with perfect information 
has a saddle point and that consequently every game of this 
kind has a solution, i. e. there is a pair of optimal strategies, 
one for each player which gives an average gain equal to the value 
of the game. If a perfect-information game contains only personal 
moves, then, if each player plays his optimal strategy, the game must 
always end with a definite outcome, namely a gain exactly equal to the 
value of the game. 

As an example of a game with perfect information we mention the 
well-known game of laying coins on a round table. Two players 
put in turn similar coins on a round table, choosing each time an 
arbitrary position for the centre of the coin; no mutual covering 
of the coins is allowed. The victor is the player who puts down the last 
coin (when there is no place to put down any others). It is obvious 
that the outcome of this game is always predetermined, and there 
is a quite definite strategy guaranteeing a sure win for the player 
who is the first to put down a coin. That is, he must put the first 
coin in the centre of the table and then make a symmetrical move 
in answer to each move of his opponent. Here the second player may 
behave in any way he likes without changing the predetermined 
outcome of the game. Therefore this game makes sense only to 
players who do not know the optimal strategy. The situation is 
the same in chess and other games with perfect information; all 
these games have a saddle point and a solution which shows each of 
the players his optimal strategy; no solution of the game of chess 
has been found, only because the number of combinations of 
possible moves in chess is too large for a game matrix to be 
constructed and a saddle point to be found in it. 

3. Pure and Mixed Strategies. The Solution of a Game 
in Mixed Strategies 

Games with a saddle point are rarely met with among finite 
games of practical importance; more typical is the case where the 
lower and the upper value of a game are different. By analysing 
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the matrices of such games we have come to the conclusion that, if 
either player is allowed a choice of only one strategy, then counting on 
a rationally acting opponent this choice must be determined by the 
minimax principle. Keeping to our maximin strategy we clearly 
guarantee ourselves a gain equal to the lower value of the game, a, 
whatever our opponent's behaviour may be. The natural question 
arises: Is it possible to guarantee oneself an average gain greater 
than a if one alternates randomly several strategies rather than playing 
only one "pure" strategy? 

In game theory such combined strategies consisting of several 
pure strategies which are alternated randomly with a definite ratio of 
frequencies are called mixed strategies. 

It is obvious that every pure strategy is a particular case 
of mixed strategy in which all strategies but one are played with 
zero frequencies and the given strategy is played with a frequency 
of 1. 

It turns out that by playing not only pure but also mixed 
strategies one can find a solution for every finite game, 
i i. e. a pair of strategies (mixed strategies in the general case) 
such that, if they are used by both players, the gain will be 
equal to the value of the game, and such that if the optimal 
strategy is deviated from unilaterally by any of the players, the gain 
may change only in the direction disadvantageous to the deviating 
player. 

This statement constitutes the substance of the so-called 
fundamental theorem of game theory which was first proved in 1928 
by von Neumann. The existing proofs of the theorem are rather 
complicated; therefore we shall only state it. 

Every finite game has at least one solution (possibly in the range 
of mixed strategies). 

The gain derived from a solution is called the value of the 
game. It follows from the fundamental theorem that every finite 
game has a value. It is obvious that the value of a game, v, is always 
between the lower value of the game a and the upper value of the 
game (3: 

a «c v < p (3.1) 

Indeed, a is the maximum safe gain that we can assure ourselves 
playing only our pure strategies. Since mixed strategies include all the 
pure strategies as particular cases, by admitting mixed strategies 
as well as the pure strategies, we at any rate do not make our 
possibilities worse; consequently, 

v > a 
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Similarly, considering our opponent's possibilities, we show that 

vsgp 
Inequality (3.1) follows from these. 

We introduce a special notation for mixed strategies. If, for 
example, our mixed strategy consists in playing strategies A u A 2 , 
and A 3 with frequencies p x , p 2 , and p 3 , with p t + p 2 + p 3 = 1, 
we shall denote it by 

C _ I ■"! ^2 ^3 



^Pl P2 f>3, 

Similarly, our opponent's mixed strategy will be denoted by 

^1 &2 B 3 



S„-. 

4l 92 ^3 

where q u q 2 , and q 3 are the frequencies in which strategies 
Bi, B 2 , and B 3 are mixed; qi + q 2 + q 3 = 1. 

Suppose that we have found a solution of the game which consists 
of two optimum mixed strategies, SJ and S$. In the general case, not 
all pure strategies available to a given player enter into his mixed 
strategy. We shall call the strategies entering into a player's optimum 
mixed strategy his "utility" strategies. 

It turns out that the solution of a game possesses still 
another remarkable property : if one of the players keeps to his optimum 
mixed strategy S* (S%), then the gain remains unchanged and equal to 
the value of the game, v, no matter what the other player does, provided 
he keeps within his "utility" strategies. For example, he may 
play any of his "utility" strategies in pure form or mix them in any 
proportions. 

Let us prove this statement. Suppose there is a solution 
S*, SI of an m x n game. For the sake of concreteness, we shall 
consider that the optimum mixed strategy S% consists of a mixture 
of three "utility" strategies, A t , A 2 , and A 3 ; correspondingly, S% 
consists of a mixture of three "utility" strategies B u B 2 , B 3 : 

\Pi Pi Pi) \ii q 2 q 3 

with pi + p 2 + p 3 = 1; qi + q 2 + q 3 = 1. It is asserted that if we 
adhere to strategy S% then our opponent may play strategies 
B lt B 2 , B 3 in any proportions and the gain will remain unchanged 
and be as before equal to the value of the game, v. 

We prove this as follows. Let v u v 2 , v 3 be the gains when 
we play our strategy S3 and our opponent plays his strategies 
Bi, B 2 , and B 3 , respectively. 
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It follows from the definition of an optimal strategy that no 
deviation of our opponent from this strategy S% can be advantageous 
to him, therefore 

Vi #- v; v 2 ^ v; v 3 $5 v 

We shall see whether the value of v,, v 2 , or v 3 can, at least in one 
of the three cases, be found to be greater than v. It turns out that 
it cannot. Indeed, let us express the gain v for the optimal strategies 
S3, S% in terms of the gains v t , v 2 , v 3 . Since in S% strategies 
B u B 2 , and B 3 are played with frequencies q u q 2 , q 3 , we have 

v = Viqi + v 2 q 2 + v 3 <j 3 (3.2) 

(<?i + qi + 93 = 1) 

It is obvious that, if at least one of the quantities v,, v 2 ,"v 3 
were greater than v, then their mean weighted value (3.2) would 
also be greater than v, which contradicts the condition. Thus a 
very important property of optimal strategies is proved; it will be 
extensively used in solving games. 

4. Elementary Methods for Solving Gaines. 
2x2 and 2 x n Games 

If an m x n game has no saddle point, then finding a solution 
is in general a very difficult problem, especially with large m and n. 

Sometimes one can simplify this problem if one can reduce 
the number of strategies by deleting superfluous ones. 

. Strategies are superfluous if they are (a) duplicated or (b) clearly 
disadvantageous. Consider, for example, a game with the following 
matrix: 



^^■v. B 

A ^^^^ 


B, 


Bt 


*! 


B, 


Ax 


1 


2 


4 


3 


A 2 





2 


3 


2 


*i 


1 


2 


4 


3 


A, 


4 


3 


1 
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It can easily be seen that strategy A 3 precisely repeats 
("duplicates") strategy A u therefore any of these two strategies can 
be deleted. 

Further, by comparing the elements of rows A x and A 2 we see 
that each entry in row A 2 is less than (or equal to) the 
corresponding entry in row A x . It is obvious that we should never 
use strategy A 2 \ for it is clearly disadvantageous. Deleting A 3 
and A 2 reduces the matrix to a simpler form (see below). 



A ^\^^ 


B \ 


», 


B, 


B, 


*\ 


1 


2 


4 


3 


A A 


4 


3 


1 






Further notice that strategy B 3 is clearly disadvantageous to our 
opponent; deleting it reduces the matrix to its final form (see 
below). Thus, by deleting the duplicated and clearly disadvantageous 
strategies, the 4 x 4 game is reduced to a 2 x 3 game. 



A ^^-^^ 


B, 


Sj 


fl 4 


A, 


1 


2 


3 


A, 


4 


3 






The procedure of deleting duplicated and clearly disadvantageous 
strategies should always be carried out before solving the game. 

The simplest cases of the finite games that can always be 
solved by elementary methods are 2 x 2 and 2 x m games. 

Consider a 2 x 2 game with the following matrix. Two cases 
can occur here: (1) the game has a saddle point; (2) the game 
has no saddle point. In the first case the solution is obvious, 
it is a pair of strategies intersecting in the saddle point. Notice 
by the way that the presence of a saddle point in a 2 x 2 game 
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--^^ B 


B, 


B, 


*i 


0\\ 


Q| 2 


A 2 


°2\ 


a 22 



always corresponds to the existence of clearly disadvantageous 
strategies. These should be deleted at the outset*. 

Suppose there is no saddle point and, consequently, the lower 
value of the game is not equal to the upper value a # (5. Find 
an optimum mixed strategy for player A: 

s* = ( Ai Al " 

A \Pl P2, 

It is notable for the property that whatever our opponent does 
(provided he does not deviate from his "utility" strategies) the 
gain (payoff) will be equal to the value of the game, v. In a 2 x 2 
game both strategies of our opponent are "utility" strategies, otherwise 
the game would have a solution in the range of pure strategies (there 
would be a saddle point). It means that if we adhere to our optimal 

strategy S* = I ' 2 ) , then our opponent may play any of his 

\Pi PiJ 

pure strategies £,, B 2 without changing the average payoff v. 
Thus we have two equations: 



aiiPi + a 2X p 2 =v 
«2iPi +a 22 p 2 = v 



(4.1) 



from which, taking into account the fact that p x + p 2 = 1, we get 
flnPi + a 2i (l - p 1 ) = a 12 p 1 + a 22 (l - p t ) 



Pi =■ 



a 2 i -a 21 



+ aii - o-\i -«2i 



(4.2) 



The value of the game v is found by substituting the values of 
p y , p 2 into any of the equations (4.1). 



* It is left for the reader to check this for himself with a number of 
matrices. 
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If the value of the game is known, then one equation is enough 
to determine our opponent's optimal strategy S% = ( ! 2 I, for 

\9i w 



example: 



anqi +a l2 q 2 = v 



from which, considering that q x + q 2 = 1, we have 



v-a 12 

<?i = - — ~ qi = l~qi 

an -an 



EXAMPLE 1. We find the solution of the 2 x 2 game considered 
in Example 1 of Section 1 having the following matrix. 



A ^~~^\^^ 


B, 


B : 


*\ 


1 


-1 


A 2 


-1 


1 



The game has no saddle point (a = — 1 ; P = +1) and, consequently, 
the solution must lie in the range of mixed strategies: 

\Pi PiJ \qi qi, 

It is necessary to find p u p 2 , q u and q 2 . 
For Pi we have the equation 

1 x p, + (- 1) x (1 - p,) = (- 1) x p, + 1 x (1 - Pl ) 
from which 

Pl = l /l P2 = l l 2 



Similarly we find 



<h = V 2 q 2 = 1 h v = 



Consequently, the optimal strategy for either player is to alternate 
his pure strategies at random, employing each as often as the other; 
here the average payoff will be zero. 

The conclusion we have come to was sufficiently clear beforehand. 
In the next example we shall consider a more complex game whose 
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solution is not so obvious. The example is an elementary game of 
the sort known as "bluffing" games. It is common practice in conflict 
situations to use various methods of bluffing, or misleading one's 
opponent (misinformation, arrangement of dummy targets, etc.). The 
example is quite instructive in spite of its simplicity. 

EXAMPLE 2. The game is as follows. There are two cards, 
an ace and a deuce. Player A draws either of the two at random; 
B does not see which card is drawn. If A has drawn the ace, 
he says "I've got the ace" and demands a dollar from his 
opponent. If A has drawn the deuce, then he may either (At) 
say "I've got the ace" and demand a dollar from his opponent or 
(A 2 ) confess that he has got the deuce and pay his opponent a 
dollar. 

The opponent, if he is paid the dollar voluntarily, can only 
accept it. If, however, a dollar is demanded from him, then he may 
either (B t ) believe that player A has got the ace and give him 
the dollar or (B 2 ) demand a check so as to see whether -4's 
statement is true or not. If it is found that A does have the ace, 
B must pay A two dollars. If, however, it is found that A is 
bluffing B and has the deuce, player A pays B two dollars. 

Analyse the game and find the optimal strategy for each 
player. 

SOLUTION. The game has a comparatively complex structure; 
it consists of one obligatory chance move — player A's choice of 
one of the two cards — and two personal moves which may or may not 
be realized, however. Indeed, if A has drawn the ace, he makes no 
personal move: he is given only one possibility, that of demanding a 
dollar, which he does. In this case the personal move — to believe 
or not to believe (i. e. to pay or not to pay the dollar) — is 
passed to player B. If A has got the deuce as a result of his 
first chance move, then he is given a personal move: to pay a 
dollar or try to bluff his opponent and demand a dollar (in 
short, "not to bluff or "to bluff). If A decides on the first 
choice, then it remains for B to accept the dollar; if A decides 
on the second choice, then player B is given a personal move: 
to believe or not to believe A (i. e. to pay A a dollar or demand 
a check). 

The strategies of each of the players are the rules indicating 
how the player should act when given a personal move. 

Evidently A has only two strategies: 

to bluff, A x and not to bluff, A 2 . 

B has also two strategies: 

to believe, B u and not to believe, B 2 . 
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We construct the matrix of the game. To this end we calculate 
the average payoff for each combination of strategies. 

1. A t Bi (A is bluffing, B believes). 

If A has got the ace (the probability of which is l / 2 ), then 
he is not given any personal move; he demands a dollar and 
player B believes him; the payoff is 1. 

If A has got the deuce (the probability of which is also l / 2 ), 
then in accordance with his strategy he bluffs and demands a 
dollar; B believes him and pays him the money; A's payoff is 
also 1. 

The average payoff is 

a u = l / 2 x \ + l / 2 x I = 1 

2. A X B 2 (A is bluffing, B does not believe). 

If A has got the ace, he has no personal move; he demands a 
dollar; in accordance with his strategy, B does not believe A and as a 
result of the check pays A two dollars (A's payoff is +2). 

If A has got the deuce, then in accordance with his strategy 
he demands a dollar; B, in accordance with his, does not believe 
him; as a result A pays him two dollars (A's payoff is -2). 

The average payoff is 

0,2 = 72* ( + 2)+ 7 2 x(-2)=0 

3. A 2 B l (A is not bluffing, B believes). 

If A has drawn the ace, he demands a dollar; in accordance 
with his strategy, B pays the money; A's payoff is 1. If A has 
drawn the deuce, then in accordance with his strategy he pays B a 
dollar; it remains for B only to take the money (A's payoff is 
— 1). The average payoff is 

«2, = 7 2 x(+i)+72 x(-i) = o 

4. ^2^2 (A is not bluffing, B does not believe). 

If A has drawn the ace, he demands a dollar; B checks and as a 
result pays A two dollars (the payoff is +2). 

If A has drawn the deuce, he pays B a dollar; it only remains 
for B to take the money (the payoff is — 1). 

The average payoff is 

a22 = 7 2 x( + 2)+72X(-l)=72 

We construct the matrix of the game (see below). It has no 
saddle point. The lower value of the game is a = 0; the upper value of 
the game is p = 1 / 2 . We find the solution of the game in the range 
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^r_-^_ 


B, (to believe) 


B-, (not to believe) 


A { (to blufO 


1 





A 2 (not to blufO 





72 



of mixed strategies. Applying formula (4.2), we have 

'/? _ 1/ „ _ 2/ o*_Ml ^2 



Pi = 



i. e. player /I should play his first strategy (which is to bluff) 
one-third of the time and his second strategy (not to bluff) two- 
thirds of the time. Then he will gain on average the value of the 
game 

v = 7a 

The value v = V3 implies that under these circumstances the game 
is advantageous to A and disadvantageous to B. Playing his optimal 
strategy, A can always assure himself a positive average payoff. 

Notice that if A played his most cautious (maximin) strategy- 
(both strategies, A x and A 2 , are maximin in this case), he would 
have an average payoff equal to zero. Thus a mixed strategy 
enables A to realize his advantage over B arising from the 
given rules of the game. 

We determine the optimal strategy for B. We have 

<j! x 1 + q 2 x = 7s q x = 7s <?2 = 7s 



whence S% = 



B x B : 

7 3 7 



i. e. player B should believe A and pay 



him a dollar without a check one-third of the time and should 
check two-thirds of the time. Then he will on average lose 73 of a 
dollar for each game. If he played his pure minimax strategy B 2 
(not to believe), then he would on average lose 72 of a dollar 
for each game. 

The solution of a 2 x 2 game can be given a simple geometrical 
interpretation. Let there be a 2x2 game with the matrix below. 

Take a section of the axis of abscissae of length 1 (Fig. 4.1). The 
left-hand end of the section (the point with the abscissa x = 0) will 
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A ^~^^_^ 


B, 


*2 


Ax 


Oil 


fl 12 


A 2 


a 2\ 


«22 



represent strategy /li; the right-hand end of the section (x = 1) will 
represent strategy A 2 . Draw through the point A l and A 2 two 
perpendiculars to the axis of abscissae; the axis /-/ and the axis 
II-II. We shall use the axis I-I to plot payoffs for strategy 




Fig. 4.1 



Ai and the axis //-// to plot payoffs for strategy A 2 . Consider 
our opponent's strategy B u it gives two points on the axes /-/ and 
//-// with the ordinates a n and a 2l , respectively. Draw through 
these points the line B l B l . It is obvious that if we play the mixed 

strategy S A = I ' 2 I against our opponent's strategy B x then 

\Pi Pz/ 
our average payoff, equal to flnPi + a X2 Pi in this case, will be 
represented by the point M on the line B { B U the abscissa of the point 
is equal to p 2 . We shall adopt the convention to refer to the line 
BiBi representing the payoff when strategy B t is played as "strategy 
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Fig. 4.2 




Fig. 4.3 



Evidently, in precisely the same way it is possible to construct 
strategy B 2 as well (Fig. 4.2). 

We must find an optimal strategy S*, i. e. such that the minimum 
payoff for it (whatever B's behaviour) would be maximized. To 
this end, construct the lower bound for the payoff for strategies 
B], B 2 , i. e. the broken line B i NB 2 marked in bold type in Fig. 4.2. 
This lower bound will express the minimum payoff for any of player 
y4's mixed strategies; it is the point N at which this minimum payoff 
attains the maximum that determines the solution and the value of 
the game. It is easy to make sure that the ordinate of the point N 
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is the value of the game, v, and that its abscissa is equal to p 2 , 
the relative frequency with which strategy A 2 is played in the 
optimum mixed strategy S*. 

In our case the solution of the game was given by the point of 
intersection of the strategies. This is not always the case, however; 
Fig. 4.3 shows a case where in spite of the presence of an intersection 
of the strategies the solution gives pure strategies (A 2 and B 2 ) 
for both players, and the value of the game v = a 22 . 

In this case the matrix has a saddle point, and strategy A { is 
clearly disadvantageous, since, whatever our opponent's pure strategy, 
it gives a smaller payoff than A 2 . 

In the case where our opponent has a clearly disadvantageous 
strategy, the geometrical interpretation is of the form shown in Fig. 4.4. 




Fig. 4.4 



In this case the lower bound of the payoff coincides with strategy 
6,; strategy B 2 is clearly disadvantageous to our opponent. 

Geometrical interpretation makes it possible to represent 
graphically the lower and the upper value of the game as well 
(Fig. 4.5). By way of illustration we shall construct geometrical 
interpretations for the 2x2 games of Examples 1 and 2 (Figs. 4.6 
and 4.7). 

We have made sure that any 2x2 game can be solved by elementary 
methods. It is possible to solve in a quite similar way any 2 x n 
game where we have only two strategies and our opponent has an 
arbitrary number of strategies. 
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Fig. 4.5 
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Fig. 4.6 



Suppose we have two strategies A x , A 2 at our disposal and our 
opponent has n strategies: B u B 2 , .-., B„. The matrix ||ay|| is 
given; it consists of two rows and n columns. As in the case of two 
strategies we give a geometrical interpretation of the problem; our 
opponent's n strategies will be represented by n straight lines (Fig. 4.8). 
We construct the lower bound for the payoff (the broken line 
B^MNB 2 ) and find in it the point N with the maximum ordinate. 
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Fig. 4.8 



This point gives a solution to the game ( strategy S% = 



Pi Pi 



; the ordinate of the point N is equal to the 



value of the game, v, and the abscissa is equal to the relative 
frequency p 2 of strategy A 2 . 

In the case at hand our opponent's optimal strategy results from 
playing a mixture of two "utility" strategies, B 2 and B 4 , intersecting in 
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the point N. Strategy B 3 is clearly disadvantageous and strategy B, 
is disadvantageous when the optimal strategy S% is played. If A adheres 
to his optimal strategy, then the payoff will remain unchanged no 
matter which of his "utility" strategies B may play; it will 
change, however, if B passes to his strategies Bj or B 3 . 

It is proved in game theory that every m x n game has a solution 
in which the number of "utility" strategies for either party does 
not exceed the smaller of the two numbers m and n. From this 
it follows, in particular, that a 2 x m game has always a solution in 
which there are at most two "utility" strategies for either party. 

By making use of a geometrical interpretation it is possible 
to give a simple method for solving any 2 x m game. From the 
graph we find a pair of our opponent's "utility" strategies, Bj 
and B k , intersecting in the point N (if there are more than two strategies 
intersecting in the point N, we take any two of them). We know that 
if player A keeps to his optimal strategy, then the payoff does not 
depend on the proportion in which B plays his "utility" strategies; 
consequently 

Piaij + p 2 i2j = v } 

Pl^U + P2«2* = v J 

From these equations and the condition p 2 = 1 — P\ we find 
p u Pi and the value of the game, v. 
Knowing the value of the game, one can at once determine player 

Bj B k 



B's optimal strategy S% = 

V <ij Ik 

To do this, one solves, for example, the equation 

<?/Zu + <?*au = v 
where 

Qj + <lk = l 

In the case where we have m strategies at our disposal while our 
opponent has only two, the problem is obviously solved in a quite 
similar way; suffice it to say that by changing the sign of the 
payoff it is possible to turn player A from the "winner" into 
the "loser". It is also possible to solve the game without changing 
the sign of the payoff, then the problem is solved for B directly, but the 
upper bound of the payoff is constructed rather than the lower 
bound (Fig. 4.9). One looks for the point N on the bound with the 
minimum ordinate, which is precisely the value of the game, v. 
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Fig. 4.9 

We shall consider and solve several examples of 2 x 2 and 
2 x m games which are simplified cases of games of practical 
importance. 

EXAMPLE 3. Side A is sending two bombers, / and //, to the 
enemy's assembly area; / is flying in front, and // is flying behind. One 
of the bombers, which precisely it is not known beforehand, is 
carrying a bomb, the other is acting as an escort. 

Over the enemy's area the bombers are attacked by side 
B's fighter. The bombers are armed with guns having different 
rates of fire. If the fighter attacks the back bomber //, then it is 
fired at only by this bomber's guns; if, however, it attacks the 
front bomber, then it is fired at by both bombers. The fighter 
is hit with a probability of 0.3 in the first case and that of 0.7 in the 
second. 

If the fighter is not shot down by the defensive fire of the 
bombers, then it hits the target it chooses with a probability of 
0.6. The task of the bombers is to carry the bomb to the target; 
the task of the fighter is to prevent this, i. e. to shoot down the bomber 
carrying the bomb. Choose optimal strategies for each side: 

(a) for side A: which bomber is to carry the bomb? 

(b) for side B: which bomber is to be attacked? 
SOLUTION. We have here a simple case of a 2x2 game; the 

payoff is the probability of the bomber carrying the bomb 
remaining unhit. 

Our strategies are: 

A\. bomber / is to carry the bomb; 
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A 2 : bomber // is to carry the bomb. 
The enemy's strategies are: 
B\ m . bomber / is to be attacked; 
B 2 : bomber // is to be attacked. 

We set up the matrix of the game, i. e. find the average payoff 
for each combination of strategies. 

1. A t Bi (I is carrying a bomb, / is attacked). 

The bomber carrying a bomb will not be hit, if the fighter is 
shot down by the bombers or if it is not shot down but misses 
its target: 

fln =0.7 + 0.3 x 0.4 = 0.82 

2. A 2 B X (II is carrying the bomb, J is attacked). 

021 = 1 

3. A l B 2 (I is carrying the bomb, II is attacked) 

a, 2 = 1 

4. A 2 B 2 (II is carrying the bomb, // is attacked) 

a 22 = 0.3 + 0.7 x 0.4 = 0.58 

The matrix of the game is of the form 



^~\-^ B 


B, 


*2 


Ax 


0.82 


1 


A 2 


1 


0.58 



The lower value of the game is 0.82; the upper value is /. The 
matrix of the game has no saddle point; we seek the solution 
in the range of mixed strategies. 
We have 

Pi x 0.82 + p 2 x 1 = v 

Pi x 1 + Pi x 0.58 = v 
p 2 = l 



Those equations yield 
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Pi 



Pi = 0.7 p 2 = 0.3 



Our optimal strategy is 

'A, A 2 
,0.7 0.3 



c* _ 
•*A — 



i. e. bomber / should be chosen to carry a bomb more often 
than //. The value of the game is 

v = 0.874 

Knowing v we determine q l and q 2 , the relative frequencies for 
strategies Bj and B 2 in the enemy's optimal strategy S%. We have 

q x x 0.82 + q 2 x 1 = 0.874 
<72 = 1 -«i 

thence 

«, =0.7 q 2 =0.3 

i. e. the enemy's optimal strategy is 

iB ~\0.7 0.3 




Fig. 4.10 



EXAMPLE 4. Side A is attacking a target, side B is defending 
it. Side A has two aircraft; side B has three anti-aircraft guns. 
Each aircraft carries a powerful weapon; for the target to be hit, it is 
enough for at least one aircraft to break through to it. Side 
A's aircraft may choose any of the three directions, /, //, or /// 
(Fig. 4.10). 

The enemy (side B) may place any of its guns in any 
direction, each gun raking only the area of space covering one 
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direction and leaving unraked the other directions. Each gun may 
fire at one aircraft only; the aircraft fired at is hit with a 
probability of 1. Side A does not know where the guns are 
placed; side B does not know from where the aircraft will 
appear. Side A's task is to hit the target; side B's task is to 
prevent it from being hit. Find the solution of the game. 

SOLUTION. This game is a 2 x 3 game. The payoff is the probability 
of hitting the target. Our possible strategies are: 

Ai\ to send one aircraft in one of the three directions and one 
in another; 

A 2 : to send both aircraft in one of the directions. 

The enemy's strategies are: 
" B,: to place a gun in each of the directions; 

B 2 : to place two guns in one of the directions and one in another; 

B 3 : to place all the three guns in one direction. We set up the matrix. 

1. AiBt (the aircraft are flying in different directions; the guns 
are placed one by one). Clearly no aircraft will break through to 
the target in this situation: 

a„=0 

2. A 2 Bi (the aircraft are flying together in one direction; the 
guns are placed one by one). In this situation, one aircraft 
will evidently pass to the target without being fired at: 

a 21 = 1 

3. AiB 2 (the aircraft are flying one by one, the enemy is 
protecting two directions, leaving a third unprotected). The 
probability of at least one aircraft breaking through to the target 
is just the probability of one of them choosing the unprotected 
direction: 

4. A 2 B 2 (the aircraft are flying together in one direction; the 
enemy is protecting one direction with two guns and another with 
one gun, i. e. it is protecting one direction, leaving the other 
two practically unprotected). The probability of at least one aircraft 
breaking through to the target is just the probability of the 
pair of aircraft choosing a practically unprotected direction: 

an = 2 / 3 

5. A x Bi (the aircraft are flying separately, the enemy is protecting 
only one direction with the three guns) 

013= 1 

40 



6. A 2 B 3 (the aircraft are both flying together; the enemy is 
protecting only one direction with the three guns). For the target 
to be hit, the aircraft should both choose an unprotected direction: 



«23 



= V 3 



The matrix of the game is as follows: 



^^-^. B 


A, 


B, 


h 


*\ 





2 h 


1 


*2 


1 


2 h 


2 /3 



It can be seen from the matrix that strategy B 3 is clearly 
disadvantageous in comparison with B 2 (this could have been 
determined beforehand). Deleting strategy B 3 reduces the game to a 
2x2 one: 





*. 


B, 


Ax 





2 h 


*i 


1 


2 h 



The matrix has a saddle point; the lower value of the game 
(2/3) coincides with the upper one. 

Notice simultaneously that strategy A x is clearly disadvantageous 
to us (A). The conclusion is that both sides, A and B, should 
always play their pure strategies A 2 and B 2 , i- e. we should send the 
aircraft in twos, choosing randomly the direction in which to send the 
pair; the enemy should place its guns as follows: two in one 
direction, one in another; the choice of these directions should 
also be made at random (here "pure strategies" are seen to 
include an element of chance). Playing these optimal strategies, we 
shall always get an average payoff of 2 / 3 (i. e. the target will 
be hit with a probability of 2 / 3 ). 
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Notice that the solution of the game we have obtained is 
not unique; besides a pure strategy solution, there are a whole 
range of mixed solutions for player A that are optimal, from 
Pi = to pi = 1 / 3 (Fig. 4.11). For example, it is easy to see directly 
that the same average payoff of 2 / 3 will result if we play our 
optimal strategies A l and A 2 in the ratio of 1 / 3 and 2 / 3 . 




Fig. 4.11 

EXAMPLE 5. The conditions are the same as in the preceding 
example, but we may strike a blow from four directions and 
the enemy has four guns at its disposal. 

Solution. As before we have two possible strategies: 

Ai 4 . to send the aircraft one by one; 

A 2 : to send both aircraft together. 

The enemy has five possible strategies: 

Bi(l + 1 4- 1 + 1): to place a gun in each direction; 

B 2 (2 + 2): to place two guns in one direction and two 

in some other; 
B 3 (2 + 1 + 1): to place two guns in one direction and one 

in each of some other two; 
B 4 (3 + 1): to place three guns in one direction and one 

in some other; 
B 5 (4): to place all the four guns in one direction. 

Discard strategies B 4 , B 5 beforehand as clearly disadvantageous. 
Arguing as in the previous example, construct the matrix of 
the game: 
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A ^^^^ 


B, (1 + 1 + 1 + 1) 


B : (2 + 2) 


By (2+1 + 1) 


A, 





5 / 6 


V: 


A 2 


1 


'/2 


V4 



The lower value of the game is 1 / 2 , the upper one is 3 / 4 . 

The matrix has no saddle point; the solution lies in the range 

of mixed strategies. Making use of the geometrical interpretation 

(Fig. 4.12), we single out the enemy's "utility" strategies: B y and B 2 . 



1 








1 


I 

>B, 


B 2 < 






,N^ 




,B, 


B 3 < 






1 V 




,B 2 


»i 




' s /a 




J4_ 







(A,) 




-O — 


< 


(A 2 ) x 



Fig. 4.12 



The frequencies p x an d Pj are determined from the equations 

p, x + (1 - p^ x 1 = v 

Pt x s / 6 + (l - Pl ) x V 2 =v 



thence 



P. = 3 /s 

P2 = 78 

v = 5 /s 



i. e. our optimal strategy is 



S* = 



Ai A; 
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Playing it we assure ourselves an average payoff of 5 / 8 . Knowing 
the value of the game v = 5/8, we find the frequencies q x and q 2 
for the enemy's "utility" strategies: 

«.xO + (l-?,)x 7e = 7s 

<Jl = 74 <?2 = 74 

The optimal strategy for the enemy will be 

s*-( Bl * 2 

EXAMPLE 6. Side A has two strategies, A x and A 2 , and side B 
has four, B lt B 2 , B 3 and 6 4 . The matrix of the game is of the 
form: 





B, 


B, 


B. 


B, 


^ 


3 


4 


10 


12 


*2 


8 


4 


3 


2 



Find the solution of the game. 

SOLUTION. The lower value of the game is 0.3; the upper 
value is 0.4. The geometrical interpretation (Fig. 4.13) shows that 
player B's utility strategies are B, and B 2 or B 2 and B 4 . 
Player A has an infinite number of optimum mixed strategies: in an 
optimal strategy, p, may change from 1/5 to 4/5. The value of the game 
v = 4. The pure optimal strategy for player B is B 2 . 




Fig. 4.13 
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5. General Methods for Solving Finite Games 

So far we have considered only the most elementary games of 
the 2 x n type which are very easy to solve and allow convenient 
and graphic geometrical interpretation. 

In the general case, solving an m x n game is a rather difficult 
problem, as the complexity of the problem and the amount 
of calculation involved sharply increases with increasing m and n. 
These difficulties are not fundamental, however, and are associated 
only with the very large amount of calculation involved, which in 
some cases may prove impracticable. Essentially, the method of 
solution remains the same for any m. 

To illustrate, consider an example of a 3 x n games. We shall 
give it a geometrical interpretation, a three-dimensional one this 
time. Our three strategies, A lt A 2 and A 3y will be represented 
by three points in the xOy plane; the first lies at the origin of 
coordinates (Fig. 5.1), the second and the third are in the Ox 
and Oy axes at distances of 1 from the origin. 

Through the points A u A 2 , and A$, the axes /-/, //-//, 
and Ill-Ill are drawn perpendicular to the xOy plane. The axis 
/-/ is used to plot the payoff for strategy A l and the axes 




Fig. 5.1 
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II -I I and I II -1 1 1 to plot the payoffs for strategies A 2 and A 3 . 
Each of our opponent's strategies, B jt will be represented by a plane 
intercepting along the axes /-/, //-//, and III-III segments 
equal to the payoffs for the corresponding strategies A u A 2> A 3 , 
and strategy Bj. By constructing in this way all of our opponent's 
strategies, we obtain a family of planes over the triangle 
A lt A 2 , A 3 (Fig. 5.2). It is possible to construct the lower bound 




of the payoff for this family, in the same way as we did in the 
2 x n case and find the point N on that bound having 
maximum altitude above the xOy plane. This altitude is the value 
of the game, v. The frequencies p u p 2 , p 3 for the strategies 
A u A 2 , A 3 in the optimal strategy S* will be given by the 
coordinates (x, y) for the point N, namely 

p 2 = x p 3 = v pi = 1 - p 2 - Pi 

However, such a geometrical representation is not easy to 
visualize even for the 3 x n case and requires a lot of time and 
imaginative effort. In the general case the representation is in m- 
dimensional space and our geometrical intuition fails, although in 
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some cases it may be found helpful to employ geometrical 
terminology. When solving m x n games in practice, it is more 
convenient to use computational analytic methods rather than 
geometrical analogies, especially as these methods are the only 
methods suitable for solving problems on computers. 

All these methods essentially reduce to solving problems by 
successive sampling, but the ordering of a sequence of samples makes 
it possible to construct an algorithm leading to a solution in the 
most economical way. 

Here we shall dwell briefly on a calculation procedure for 
solving m x n games — the so-called "linear programming" method. 

To this end we shall first give a general statement of the 
problem of finding the solution of an m x n game. Given anmxn 
game with m strategies, A u A 2 , ..., A m for player A and n 
strategies, B,, B 2 , ..., B„, for player B, and payoff matrix 
||a w ||, it is required to find a solution of the game, i. e. two 
optimum mixed strategies for players A and B 

(A, A 2 A m \ ( Bl B 2 ...B, 

\ Pi Vl--- PmJ \<?1 Q2 ■■■ % 

where p t + ... + p m - 1 ; q x + . . . + q„ = 1 (some of the numbers 
Pi and q } may be equal to zero). 

Our optimal strategy S3 must assure us a payoff not less than v 
whatever our opponent's behaviour, and a payoff equal to v if 
our opponent's behaviour is optimal (strategy S%). Similarly strategy 
S% must assure our opponent a loss not greater than v whatever 
our behaviour, and equal to v if our behaviour is optimal (strategy 

S% 

In this case we do not know the value of the game, v; we shall 
assume that it is equal to some positive number. This does not 
violate the generality of the reasoning; to have v > 0, it is evidently 
sufficient for all elements of matrix ||a y || to be nonnegative. 
This can always be attained by adding to elements ||a y || a 
sufficiently large positive value L; then the value of the game will 
increase by L while the solution will remain unchanged. 

Suppose we have chosen our optimal strategy S% Then our 
average payoff, when our opponent plays his strategy Bj, will be 



Our optimal strategy S% has the property such that, whatever our 
opponent's behaviour, his strategy assures us a payoff not less than v; 
consequently, it is impossible for any of the numbers a, to be less 
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than v. A number of conditions result: 

Pl^ll + P2«21 + ... + p m a mi 5* v 
Pia i2 + p 2 a 22 + ... + p m a m2 ^ v 



(5.1) 



Pl«ln + P2«2n + ■ • ■ + Pm^mn > V 

We divide inequalities (5.1) by a positive value v and denote 

P./V = %x Plh = \l Pm/V = \ m 

Then conditions (5.1) can be written down as 

flllSl +021^2 + ••• +a m \km> 1 

a, 2^1 + fl 2 2^2 + ... + a m2 % m > 1 . 

(5.2) 

ai-^i + «2»^2 + • • • + a m ,£ m > 1 

where £i, t, 2 , . .., % n are nonnegative numbers. Since Pi + p 2 + ■■■ 
• •• + Pm— 1) the variables £,, £ 2 > •••> ^m satisfy the condition 

^1+^2 + ..+ 5-= 1/v (5.3) 

We want to make our safe payoff as large as possible. The 
right-hand side of equation (5.3) will then assume a minimal value. 

Thus the problem of finding the solution of the game is reduced 
to the following mathematical problem: determine the nonnegative 
variables £, x , i, 2 , ... , % m satisfying conditions (5.2) so that their sum 

* = Si +^2 + ••• +4™ 

should be minimal. 

It is usual in solving extremal (maximum and minimum) problems 
to differentiate the function and set the derivatives to zero. But 
such a method is useless in this case, since the function <t> 
to be minimized is linear and its derivatives with respect to all the 
arguments are equal to unity, i. e. vanish nowhere. Consequently, 
the maximum of the function is attained somewhere on the 
boundary of the argument range, which is determined by the 
requirement for the nonnegativity of the arguments and by conditions 
(5.2). The method of finding extreme values by differentiation is 
also useless in cases where, in order to solve the game, the maximum 
of the lower (or the minimum of the upper) bound of the payoff is 
determined, as when solving 2 x n games for instance. Indeed, the 
lower bound is composed of line segments and the maximum is 

48 



attained on the boundary of the interval or at the point of 
intersection of line segments rather than at the point where the 
derivative is equal to zero (there is no such point at all). 

To solve such problems, which occur rather often in practice, 
special linear programming techniques have been developed by 
mathematicians. 

A linear programming problem is set as follows. 

Given a system of linear equations 



"12^1 + 122^2 + • • • + a m l%m = b 2 



«r^i + a 2 £ 2 + ■■■ + a mn t, m = b n 



(5.4) 



Find nonnegative values for the variables £ 1( £ 2 , ... ^ m which 
satisfy conditions (5.4) and at the same time minimize the given 
homogeneous linear function of the variables iji, !; 2 , ..., Z, m (the 
linear form): 

* = c 1 ^ 1 + c 2 ^ 2 + ... +c m % m 

It can be easily seen that the game theory problem given previously 
is a particular case of the linear programming problem with 
Ci = c 2 = . . . = c m = 1. 

It may at first sight seem that conditions (5.2) are not 
equivalent to conditions (5.4), since they contain inequality signs 
instead of equality signs. It is easy, however, to get rid of the 
inequality signs by introducing new dummy nonnegative variables 
Zj, z 2 , -.., z„ and writing conditions (5.2) as 

fln^i +«2i^2 + ■■■a m \km - -i ^1 
012S1 + 022^2 + • • • + a m2 q,„, - : 2 \ m 

am^i + a 2 £ 2 + ■■■ + a m £„ - z„ = 
The linear form <I> which must be minimized is equal to 

Linear programming methods make it possible to select, by a 
comparatively small number of successive trials, variables t, u t, 2 , - • • , £ m 
which satisfy the set requirements. For greater clarity we shall 
demonstrate the application of these methods by solving some 
particular games. 
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EXAMPLE 1. Find the solution of the 3x3 game given in 
Example 2 of Section 1, with the following matrix: 



^^~\ B 

4 ^^v. 


B, 


B, 


fij 


^ 


2 


-3 


4 


A 2 


-3 


4 


-5 


A* 


4 


-5 


6 



To make all the a i} nonnegative, add L= 5 to each element of the 
matrix. We get the matrix 



^~^\ B 

4 ^~^\^ 


B l 


B 2 


s 3 


Ax 


7 


2 


9 


<*2 


2 


9 





A> 


9 





11 



Here the value of the game increases by 5 while the solution 
remains unchanged. 

We determine the optimal strategy S% Conditions (5.2) are of 
the form 

7^ +2^+9^1 
9$,+ H^3>1 



(5.6) 



where £, = p,/v; ^ 2 = p 2 /v; ^ 3 = Pa/v. 

To get rid of the inequality signs, we introduce the dummy 

variables z u z 2 , z 3 ; conditions (5.6) will be written as 

7^+2^+9^-^ = 1 

2^+9^ 2 -2 2 = 1 J- (5.7) 

9^1+ 11^3-23 = 1 
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The linear form O is of the form 

and is to be minimized. 

If each of B's three strategies is a "utility" one, then each of the 
three dummy variables, z,, z 2 , z 3 , will vanish (i. e. a payoff 
equal to the value of the game, v, will be attained for each 
strategy Bj). But for the present we have no grounds for 
asserting that all the three strategies are "utility" strategies. To 
check this, we shall try to express the form <t> in terms of the 
dummy variables z u z 2 , z 3 , and see if, by putting them zero, the 
minimum of the form can be obtained. To do this we solve 
equations (5.7) for the dummy variables % u i; 2 > ^3 (i- e - express 
£i» ^2> %3 m terms of the dummy variables z,, z 2 , Z3): 



^> t tr\ on 



20 



^ = W + 



99 11 81 

80" Zl + lo * + ~^' 3 



"40 



z. + 



Z2 - 



* l , 81 

^ 3 " 20~ + W Zl 



80 

_9_ 
40 

59 



Z3 



~40 Z2 ~W Z3 



(5.8) 



Adding £,,', £ 2 , £3 we have 



* 1 1 1 l 



(5.9) 



In the expression (5.9) the coefficients of all the z's are positive; 
it means that any increase of z,, z 2 , z 3 above zero will result 
only in an increase of the value of the form <t>, and we want it to be 
minimal. Consequently, the values of z u ^2. z 3 minimizing the form 
(5.9) are 

z, = z 2 = z 3 = 

Substituting them into formula (5.9) we find the minimal value 
of the form O: 



7v = y 5 



giving the value of the game 
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Substituting the zero values z,, z 2 , z 3 into formulas (5.8), we find 

SI = /20 S2 = /lO S3 = /20 

or, multiplying them by v, 

Pi = 'A P2 = V2 ?3 = 'A 

Thus the optimal strategy for A is found: 



c* _ 
""V/4 72 74, 

i. e. we should write the figure 1 one-fourth of the time, the 
figure 2 half of the time, and the figure 3 the remaining one- 
fourth of the time. 

Knowing the value of the game v = 5, it is possible to find our 
opponent's optimal strategy 



\4l 12 <?3 

using the familiar methods. To this end we shall employ any two 
of our "utility" strategies (for example, A 2 and A 3 ) and write the 
equations 

2«i +9(72=5 

9q l + U(l-q 2 -q l ) = 5 

thence 51=^3 = 74; l2 = l /i- Our opponent's optimal strategy 
will be the same as ours: 

(B, B 2 B 3 

''VU 7 2 74 

Now return to the original (untransformed) game. To do this it is 
only necessary to subtract from the value of the game, v = 5, 
the value L= 5 which was added to the matrix elements. We 
obtain the value of the original game v = 0. Consequently, the 
optimal strategies for both parties assure an average payoff equal to 
zero; the game is equally advantageous or disadvantageous to both 
parties. 

EXAMPLE 2. Sports club A has three alternative team lineups, 
A u A 2 , and A 3 . Club B has also three alternative lineups, 
B ly B 2 , and B 3 . Making an application for a contest, neither of the 
clubs knows which of the alternatives its opponent will select. The 
probabilities of club A winning with different alternative lineups of 
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the clubs' teams, which are approximately known from the experience 
of past matches, are given by the matrix. 





«. 


B, 


B\ 


Ax 


0.8 


0.2 


0.4 


A 2 


0.4 


0.5 


0.6 


A, 


0.1 


0.7 


0.3 



Find with what frequency the clubs should use each of the 
lineups in matches against each other to score the largest average 
number of victories. 

SOLUTION. The lower value of the game is 0.4; the upper value is 
0.6; the solution is sought in the range of mixed strategies. In 
order not to deal with fractions, we multiply all the matrix 
elements by 10; as a result the value of the game increases by a 
factor of 10 but the solution remains unchanged. The following matrix 
is obtained: 





a. 


B, 


B, 


Ax 


8 


2 


4 


A 2 


4 


5 


6 


A, 


1 


7 


3 



Conditions (5.5) are of the form 

8!;, +4^ + ^-z, = 1 
2^+5^+7^-22 = 1 
4^+6^ + 3^3-23 = 1 
and the minimum condition is 

* = Si + ^ + &3 = min 



(5.10) 
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Let us check if all of our opponent's three strategies are 
"utility" ones. We first suppose that the dummy variables z t , z 2 , 
z 3 are zero, and, to check this, solve equations (5.10) for !;,, S2 , S3 : 



si = 

52 = 

53 = 



10 




27 


136 


+ 


136 


12 




22 


136 




136 


8 


+ 


8 



136 



136 



*1 + 



136 

20 
136 

32 
136 



Z2 



-z 2 + 



23 
T36~ 

54 
136 

32 
736" 



(5.11) 



giving 



136<D = 30+ 13z, + 18z 2 -51z 3 



(5.12) 



Formula (5.12) shows that increasing the variables z, and z 2 in 
comparison with their supposed zero value can only increase <t>, 
whereas increasing the variable z 3 can decrease <1>. Care must be taken, 
however, in increasing z 3 , lest the variables st , S2 , £, 3 dependent 
on z 3 should be made negative in doing so. Therefore, set the 
variables z x and z 2 in the right-hand side of equations (5.11) 
to be equal to zero, and increase the variable z 3 to the permissible 
limits(until one of the variables \ u t, 2 , S3 vanishes). It can be seen from 
the second equality of (5.1 1) that increasing z 3 is "safe" for the variable 
q 2 , for this only makes it increase. As to the variables S1 and 
£3, they allow z 3 to be increased only to a certain limit. 
The variable q, vanishes when z 3 = 10 /2 3 ; the variable S3 vanishes 
before this when z 3 = l / A . Consequently, giving z 3 its maximum 
permissible value z 3 = '/* makes the variable S3 vanish. 

To check if the form cj> is minimized when z x = 0, z 2 = 0, 
S3 = 0, express the remaining (nonzero) variables in terms of 
z t , z 2 , c, 3 supposed to be zero. 

Solving equations (5.10) for !;,, \ 2 and z 3 , we get 



si = 



S2 = 



1 

"32 


+ 


5 
32"" 1 


- 


4 

32" 


z 2 


+ 


23 
32 


S3 


6 

J2 


- 


2 
"32"-" 


+ 


8 
"32" 


"2 


- 


54 
32 


S3 


8 
32 


+ 


8 

T2~' 1 


+ 




-2 


- 


136 

"32 


"S3 



54 



giving 

32<D = 7 + 3z, + 4z 2 + S3 (5.13) 

Formula (5.13) shows that any increase of the variables z,, z 2 , b3 
above their supposed zero values can only increase the value of 
the form <t>. Thus the solution of the game is found; it is given by the 
values 

z, = z 2 = S3 = 

giving 

SI = 111 S2 = /l6 z 3 = U 

Substituting into formula (5.13), we find the value of the game, v: 

32<D = 7 = 32 /v v = 32 / 7 
Our optimal strategy is 

The "utility" strategies (team lineups A x and A 2 ) should be played 
with frequencies of l /-, and 6 / 1 \ lineup A 3 should never be used. 
To find our opponent's optimal strategy, we can in the general 
case proceed as follows. We change the sign of the payoff, add t6 the 
matrix elements a constant value L to make them nonnegative, 
and solve the problem for our opponent in the same way as we 
did for ourselves. The fact that we already know the value of the 
game, v, to some extent simplifies the problem, however. 
Besides, in this particular case the problem is further simplified 
by the fact that the solution comprises only two "utility" strategies 
for our opponent, B, and B 2 , since the variable z 3 is nonzero and 
hence when strategy B 3 is played the value of the game is not 
attained. By choosing any of player A's "utility" strategies, for 
example A u it is possible to find the frequencies q t and q 2 . To do 
this we write the equation 

8^+2(1 -</,) = "A 

giving 

cix = "7 7 qi = 7 7 

Our opponent's optimal strategy is 

s*-( Bi B2 
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i. e. our opponent should not use lineup B 3 , and lineups B, 
and B 2 should be used with frequencies of 3 / 7 and 4 / 7 . 
Returning to the original matrix, we determine the value of 
the game Vo - »/t : 10 - 0457 

This means that with a large number of matches the number of club 
/4's victories will be 0.457 of all matches. 



6. Approximation Methods for Solving Games 

In practical problems there is often no need to find an 
exact solution of the game, it is enough to find an approximate 
solution giving an average payoff close to the value of the game. 
An approximate estimate of the value of the game v can be 
obtained from a simple analysis of the matrix and the determination 
of the lower (a) and upper (P) values of the game. If a and p are 
close, there is practically no need to search for an exact 
solution, it being sufficient to select pure minimax strategies. 
When a and P are not close, it is possible to obtain a practicable 
solution using numerical methods of solving games, one of which, 
the iteration method, we shall briefly describe here. 

The idea of the method is in short as follows. A "mental 
experiment" is performed in which opponents A and B play their 
strategies against each other. The experiment consists of a succession 
of elementary games each having the matrix of a specified game. 
The experiment begins with our (player /Ts) choosing arbitrarily 
one of our strategies, for example, A t . Our opponent responds to 
this with his strategy Bj which is the least advantageous to us, i. e. 
minimizes the payoff obtained when strategy A t is played. We respond 
to this move with our strategy A k which gives the maximum 
average payoff when our opponent plays strategy Bj. Then it is 
again our opponent's turn. He responds to our pair of moves, A t 
and A k , with his strategy B, which gives us the smallest average 
payoff when those two strategies (A ( and A k ) are played, and 
so on. At each step "of the iterative process each player responds 
to any move of the other player with a strategy optimal with 
respect to all his previous moves, which can be regarded as some mixed 
strategy in which the pure strategies are represented in proportions 
corresponding to the frequencies with which they are played. 

Such a method represents, as it were, a learning-by-experience 
model, each of the players testing by experience his opponent's 
mode of behaviour and trying to respond to it in the way most 
advantageous to himself. 
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If such a simulation of the learning process is continued long 
enough, then the average payoff in one pair of moves (elementary 
game) will tend to the value of the game, and the frequencies 
p u ..., p m \ (?!, ..., q„ with which the player's strategies occur 
in this experiment will approach the frequencies determining the 
optimal strategies. 

Calculations show that the convergence of the method is very 
slow; this is no obstacle, however, for high-speed computers. 

We shall illustrate the application of the iteration method by 
considerating as an example the 3 x 3 game solved in Example 2 
of the previous section. 

The game is given by the matrix 



A ^""\^^ 


B, 


B : 


B> 


*l 


8 


2 


4 


^2 


4 


5 


6 


Ay 


1 


1 


3 



Table 6.1 gives the first 18 steps in the iterative process. 
In the first column are given the serial numbers of elementary 
games (pairs of moves), «; the second column gives the numbers, /, 
of the strategies chosen by player A; the next three columns 
give the "cumulative payoffs" in the first n games when his opponent's 
strategies B ]( B 2 and B 3 are played. The minimum of these values is 
underlined. Next come the numbers, j, of the strategies chosen by 
A's opponent and, correspondingly, the cumulative payoffs in the « 
games when strategies A lt A 2 , A 3 are played; the maximum of these 
values has a bar above. The marked values determine either 
player's choice of a counter strategy. The last three columns give in 
succession the minimum average payoffs, v, equal to the minimum 
cumulative payoffs divided by the number of games, n; the maximum 
average payoffs, v, equal to the maximum cumulative payoffs 
divided by n, and their arithmetical means v* = 



As n increases all the values, v, v, and v*, will tend to 
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Table 6.1 



n 


i 


B, 


s. 


«, 


i 


*\ 


/(, 


*> 


V 


V 


V* 


1 


3 


I 


7 


3 


I 


8 


4 


1 


1 


8 


4.50 


2 


1 


9 


9 


7 


3 


12 


10 


4 


3.50 


6.00 


4.75 


3 


1 


17 


U 


11 


2 


14 


15 


11 


3.67 


5.00 


4.33 


4 


2 


21 


16 


17 


2 


16 


2~0 


18 


4.00 


5.00 


4.50 


5 


2 


25 


2] 


23 


2 


18 


2~5 


25 


4.20 


5.00 


4.60 


6 


2 


29 


26 


29 


2 


20 


30 


32 


4.33 


5.33 


4.82 


7 


3 


3_0 


33 


32 


1 


28 


34 


33 


4.29 


4.86 


4.57 


8 


2 


34 


38 


38 


1 


36 


3"8 


34 


4.25 


4.75 


4.50 


9 


2 


38 


43 


44 


1 


44 


42 


35 


4.23 


4.89 


4.56 


10 


1 


46 


45 


48 


2 


46 


47 


42 


4.50 


4.70 


4.60 


11 


2 


50 


50 


54 


1 


54 


51 


43 


4.55 


4.91 


4.72 


12 


1 


58 


52 


58 


2 


56 


56 


50 


4.33 


4.66 


4.49 


13 


2 


62 


57 


64 


2 


58 


61 


57 


4.38 


4.70 


4.54 


14 


2 


66 


62 


70 


2 


60 


66 


64 


4.43 


4.71 


4.56 


15 


2 


70 


67 


76 


2 


62 


71 


71 


4.47 


4.73 


4.60 


16 


3 


71 


74 


79 


1 


70 


75 


72 


4.44 


"4.69 


4.56 


17 


2 


75 


79 


85 


1 


78 


79 


73 


4.41 


4.65 


4.53 


18 


2 


79 


84 


91 


1 


86 


83 


74 


4.39 


4.78 


4.58 



the value of the game, v, but the value v* will naturally tend 
to it comparatively more rapidly. 

As can be seen from the example, the convergence of the iterations 
is very slow, but all tho same even such a small computation 
enables one to find an approximate value of the game and 
bring to light the predominance of "utility" strategies. The value of 
the method considerably increases when computers are used. 

The advantage of the iteration method of solving games is that the 
amount and complexity of calculations increase comparatively 
weakly as the numbers of strategies, m and n, increase. 



7. Methods for Solving Some Infinite Games 

An infinite game is a game in which at least one of the parties 
has an infinite set of strategies. General methods for solving such 
games are still weakly developed. Some particular cases allowing 
comparatively simple solutions may be of interest in applications, 
however. 

Consider a game between two opponents, A and B, each having 
an infinite (uncountable) set of strategies; for player A these 
strategies correspond to different values of a continuously varying 
parameter x, and for B, to those of a parameter y. In this case, 
instead of being given by matrix | a y || the game is defined by some 
function of two continuously varying arguments, a(x, y), which we 
shall call a gain function (notice that the function a(x, y) itself 
need not be continuous). The function a(x, y) can be represented 
geometrically by some surface a (x, y) over the range of the arguments 
(x, y) (Fig. 7.1). 

The analysis of a gain function a(x, y) is done in a way similar 
to the analysis of a payoff matrix. One first finds the lower 
value of the game, a; to this end one determines for each x 
the minimum of the function a(x, y) over all y: 

min a (x, y) 

y 

then the maximum of these values over all x (maximin) is sought: 
a = max min a (x, y) 

x y 

The upper value of the game (the minimax) is determined in a 
similar way: 

p = min max a (x, y) 

y " 
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Consider the case when a = p. Since the value of the game, v, 
always lies between a and p, it is their common value, that is v. 
The equality a = p implies that the surface a (x, y) has a saddle 
point, i. e. a point with coordinates x , j>o at which a(x, y) is 
simultaneously minimal with respect to y and maximal with respect 
to x (Fig. 7.2). 




Fig. 7.2 



The value of a(x, y) at that point is precisely the value of the 
game, v: 

v = a(x , y ) 
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The presence of a saddle point means that the given infinite game 
has a solution in the range of pure strategies; x , y are 
optimum pure strategies for A and B. In the general case where a ^ p 
the game may have a solution (possibly not a unique one) only 
in the range of mixed strategies. A mixed strategy for infinite 
games is some probability distribution for the strategies x and y 
regarded as random variables. This distribution may be continuous 
and determined by the densities/, (x) and/ 2 Cv); or it may be discrete, 
in which case the optimal strategies consist of a set of separate 
pure strategies chosen with some nonzero probabilities. 

When there is no saddle point in the infinite game, it is 
possible to give a graphic geometrical interpretation to the lower 
and the upper value of the game. Consider an infinite game 
with a gain function a(x, y) and strategies x, y filling continuously 
the segments of the axes (x h x 2 ) and (y lf y 2 ). To determine 
the lower value of the game, a, it is necessary to "look" at the 
surface a(x, y) from the _y-axis, i. e. project it on the xOa plane 
(Fig. 7.3). We get some figure bounded by the straight lines 




Fig. 7.3 



x = Xi and x = x 2 at the sides and by the curves K u and Kj from above 
and below. The lower value of the game, a, is evidently the maximum 
ordinate of the curve K v Similarly, to find the upper value of the 
game, p, it is necessary to "look" at the surface a(x, y) from the 
Ox-axis (to project the surface on the yOa plane) and to find 
the minimum ordinate of the upper bound K u of the projection 
(Fig. 7.4). 
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Consider two elementary examples of infinite games. 
EXAMPLE 1. Players A and B have each an uncountable 
set of possible strategies x and y, with < x < 1, < y < 1. 




Fig. 7.4 



a(x,y) 




Fig. 7.5 

The gain function is given by the expression 
a(x, y) = {x-yf 

Find the solution of the game. 

SOLUTION. The surface a(x, y) is a parabolical cylinder 
(Fig. 7.5), and there is no saddle point in it. Determine the lower 
value of the game; evidently, min a(x,y) = for all x, hence 
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a = max mina(x, y) = 

x y 

Determine the upper value of the game. To this end find, with y 
fixed, 

max (x — y) 2 

In this case the maximum is always attained on the boundary 
of the interval (with x = or x = 1), i. e. it is equal to the greater 
of the values y 2 , (1 - y) 2 . Represent graphs of these functions 
(Fig. 7.6), i. e. the projection of the surface a(x, y) on the yOa 



1, 




a 
.1 




l^r y2 ll_y) Slt 






^^^^k 




1 1 


y='/2 


0~ 



Fig. 7.6 



plane. The bold line of Fig. 7.6 shows the function 
max (x — y) 2 . Its minimal value is evidently attained at y — '/i 
and is equal to 1 / A . Consequently, the upper value of the game 

P = l U- 

In this case the upper value of the game coincides with the 
value of the game, v. Indeed, player A can employ the mixed 



strategy S A = 



1 

7 2 7 2 



into which the extreme values x = and 



x = 1 enter with equal frequencies; then, whatever player B's 
strategy y, player ^'s average payoff will be 

It can easily be seen that this expression at any values of y 
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~y 2 + \(\-yf^'U 

Thus, by playing this mixed strategy player A can assure himself 
a payoff equal to the upper value of the game; since the value of the 
game cannot be greater than the upper value, this strategy S A is 
optimal : 

$A = Sa 

It remains to find player B's optimal strategy. 

It is obvious that, if the value of the game, v, is equal to the 
upper value of the game, P, then player B's optimal strategy 
will always be his pure minimax strategy assuring him the upper value 
of the game. Such a strategy in this case is y = i / 2 . Indeed, 
when this strategy is played, no matter what player A does, his 
payoff will not be greater than l /^. This follows from the obvious 
inequality 

(X- 1 /2) 2 =X(X-1)+ 74^74 

EXAMPLE 2. Side A ("we") are shooting at the enemy's aircraft B. 
To evade the fire the enemy's pilot may manoeuvre with 
g-load y to which he can arbitrarily assign a value from y = 
(rectilinear motion) to y = y m „ (flight along a circle of maximum 
curvature). Assume y m!XK to be a unit of measurement, i. e. set 
)W = l. 

In the fight against the enemy's aircraft we may use some 
sighting devices based on one or another hypothesis about the 
motion of the target during the flight of the projectile. The 
g-load x for this hypothetical manoeuvre can be set equal to any 
value from to 1. 

Our task is to hit the enemy; the enemy's task is to remain 
unhit. The probability of hitting for the given x and y can be 
approximated by the formula 

a(x,y) = pe- k(x -" )2 

where y is the g-load used by the enemy; x is the g-load allowed 
for in the sighting device. 

Determine the optimal strategies for both sides. 

SOLUTION. The solution of the game will evidently remain 
unchanged, if we put p = 1. The gain function a(x, y) is represented 
by the surface of Fig. 7.7. It is a cylindrical surface whose 
elements are parallel to the bisector of the coordinate angle 

64 



xOy and whose section by a plane perpendicular to an element is a 
curve of the normal distribuion type. 

Using the geometrical irmrpretation proposed above for the 
lower and the upper value of the game, we find p = 1 (Fig. 7.8) 



and a 



-k/4 



(Fig. 7.9). 



o(x,y) 




Fig. 7.7 
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Fig. 7.8 



There is no saddle point in the game; the solution must be 
sought in the range of mixed strategies. The problem is to an 
extent similar to that of the previous example. Indeed, at low 
values of k the function e~ Mv ~ v * 2 behaves approximately the way 
the function — (x - y) 2 does, and the solution of the game will be 
obtained if in the solution of the previous example players A and B 
are made to exchange their roles, i. e. our optimal strategy is the pure 
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strategy x = l / 2 , and the enemy's strategy 
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is to employ the extreme strategies y = and y = 1 with equal 
frequencies. This means that we should all the time use the sight 




Fig. 7.9 

designed for the g-load x = l / 2 , and the enemy should half of the 
time not use any manoeuvre at all and half of the time use the 
maximum permissible manoeuvre. 

It can easily be proved that this solution will be valid for 
the values of k < 2. Indeed, the average payoff when the opponent 
plays his strategy 



Sb = ( >/ 2 



and we play our strategy x is expressed by the function 



a{x) = —(e- kx 



+ e 



-Mi 



,f } 



which for the values of k < 2 has one maximum equal to the lower 
value of the game, a, when x = l / 2 - Consequently, playing 
strategy S B assures the enemy a loss not greater than a, from which 
it is clear that it is a, the lower value of the game, that is 
the value of the game, v. 

When fe > 2 the function a(x) has two maxima (Fig. 7.10) situated 
symmetrically with respect to x = i / 2 at the points x and 1 — x , 
the value of x depending on k. 

It is obvious that when k — 2 x = 1 — x = l / 2 ; as k is increased, 
the points x and 1 — x move apart tending towards the extreme 
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points (0 and 1). Consequently, the solution of the game wil| 
depend on k. Prescribe a specific value to k, for instance k = 1 
and find the solution of the game. To this end, determine the 
abscissa x of the maximum of the curve a(x). 



a(x) 



l/ 2 l-» 



Fig. 7.10 



Equating the derivative of the function a(x) to zero, write 
the following equation to determine x : 

xe- ix, = (l-x)e- 3 "-tf 

The equation has three roots; x = 1 / 2 (where the minimum is 
attained) and x , 1 - x where the maxima are attained. Solving 
the equation numerically, we find approximately 

x % 0.07 

1 - x * 0.93 

We shall prove that in this case the solution of the game is the 
following pair of strategies: 

Xq 1 — Xq 



c* _ 
■J A — 



•A 7 2 



s% = 



l h 7; 



When we play our strategy S^ and the enemy plays his strategy y 
the average payoff is 

fl , (y) = i.( e - 3(0.07 - ,f + g - 3(0.93 - tf ) 
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Let us find the minimum of a,(y) when 0<y< 1. The function 
a^ (y) is symmetrical in relation to y = 1 / 2 and can have only one or two 
maxima; its minimum is in any case attained either in the 
middle of the interval (0, 1) or at its ends. Putting y = we find 

Ul (0) = a, (1) = -^-(e- 3 * 007; + r""" ,: ) = 0.530 

Putting y = l / 2 we get 

a l { 1 /2) = e- 3 *W =0.574 

which is greater than a x (0); consequently, the value of the game 
is not less than a, (0): 

v > — (<?-3*S + e -3(i -xgj = 0530 

Now suppose that the enemy plays strategy S% and we play the strategy 
x. Then the average payoff will be 

a 2 {x) = ±(e-^ + e-W-*) z ) (7.1) 

But we have chosen x exactly so that for x = x the maximum of the 
expression (7.1) should be attained; consequently, 

a 2 {x) *S ^-(e- ix » + e" 3 " -*•?) = 0.530 

i. e. by employing strategy S% the enemy can avoid a loss greater 
than 0.530; consequently, v = 0.530 is exactly the value of the game 
and strategies S% and S^ give the solution. This means that we should 
use the sights with x = 0.07 and x = 0.93 with the same frequency, 
and the enemy should with the same frequency not manoeuvre and 
manoeuvre with the maximum a-load. 

Notice that the payoff v = 0.530 is appreciably greater than the lower 
value of the game » 

a = e -k/4 = e -0.75 =0472 

which we could assure ourselves by playing our maximin strategy 

*0='/2. 



One of the practical ways of solving infinite games is through 
approximately reducing them to finite ones. In doing so a whole 
range of possible strategies for each player is arbitrarily combined 
into one strategy. Of course, one can obtain only an approximate 
solution for a game in this way, but in most cases no exact 
solution is required. 
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One should bear in mind, however, that the use of this method 
may lead to solutions in the range of mixed strategies even in 
the cases where the solution of the original infinite game is 
possible in pure strategies, i. e. where the infinite game has a 
saddle point. If reducing an infinite game to a finite one 
yields a mixed solution which contains only two neighbouring "utility" 
strategies, then it is worthwhile attempting the in-between pure 
strategy of the original infinite game. 

Notice in conclusion that unlike finite games, infinite games 
must not necessarily have a solution. We shall cite an example of 
an infinite game which has no solution. Two players name each an 
integral number. The player who names a larger number gets a 
dollar from the other. If both name the same number, the game 
ends in a draw. The game evidently cannot have a solution. 
There are classes of infinite games, however, for which there is 
clearly a solution. It is possible to prove, in particular, that if in an 
infinite game possible strategies x, y for players A and B 
fill continuously some intervals and the gain function a(x, y) is 
continuous, then there is always a solution to the game (in pure or 
mixed strategies). 
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